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Abstract. We consider a model where the failure hazard function, conditional on a time- 
independent covariate Z is given by R{t,d^\Z) = ri^o{t)f0o{Z), with 6° = (/3°,7°)"^ G R^+f . 
The baseline hazard function rj^o and relative risk /^o belong both to parametric families. 
The covariate Z is measured with an error through an additive error model U = Z + e 
where e is a random variable, independent from Z, with known density f^. We observe a 
n-sample {Xi, Di,Ui), i — 1, . . . ,n, where Xi is the minimum between the failure time and 
the censoring time, and Di is the censoring indicator. We aim at estimating in presence 
of the unknown density g of the covariate Z using the observations {Xi, Di,Ui), i = 1, . . . ,n. 
Our estimation procedure based on least squares criterion provide two estimators of 6^. The 
first one is obtained by minimizing an estimation of the least squares criterion where g is 
estimated by density deconvolution. We give upper bounds for its risk that depend on the 
smoothness properties of and as a function of z. We derive from this construction 

sufficient conditions that ensure the y^-consistency. The second estimator is constructed 
under conditions ensuring that the least squares criterion can be directly estimated with the 
parametric rate. We propose a deep study of examples considering various type of relative 
risks ffs and various types of error density f^. We show in particular that in the Cox model 
and in the excess risk model, the estimators are -yn-consistent and asymptotically Gaussian 
estimators of 6" whatever is f^. 

Abstract. Considerons un modele a risque instantane modelise par la relation R{t, 6'^\Z) = 
'ri^o{t)ffjo{Z), ou 6*" — (/3'',7'')^ G R'""'"^. Le risque de base rj^o et la fonction de risque 
relatif f^o appartiennent a des families parametriques. La covariable Z est mesuree avec 
une erreur au travers de la relation U = Z + e, e etant une variable aleatoire, independante 
de Z, de densite connue f^. Nous disposons d'un n-echantillon {Xi, Di, Ui), i = 1, . . . ,n o\x 
Xi est le minimum entre le temps de survie et le temps de censure et Di est I'indicateur 
de censure. Notre but est d'estimer S", en presence la densite inconnue g, de la covariable 
Z, en utilisant les observations {Xi, Di,Ui), i — l,...,n. Notre methode d'estimation, 
fondee sur le critere des moindres carres nous fournit deux estimateurs. Pour le premier, 
nous etablissons des bornes superieures du risque dependant des regularites de la densite des 
erreurs et de la fonction de risque relatif, comme fonction de z. Nous en deduisons des 
conditions sufBsantes pour atteindre la vitesse parametrique. Le deuxieme estimateur est 
construit sous des hypotheses assurant que le critere des moindres carres pent etre estime a 
la vitesse parametrique. Au travers d'exemples, nous etudions les proprietes des estimateurs 
ainsi que les conditions assurant la -yn-consistance pour des fonctions de risque relatif et des 
densites d'erreurs varices. En particulier, dans le modele de Cox et dans le modele d'exces de 
risque, les estimateurs construits sont y^n-consistants et asymptotiquement gaussiens, quelle 
que soit la loi des erreurs e. 

Key Words and Phrases: Semiparametric estimation, errors-in-variables model, 
nonparametric estimation, excess risk model, Cox model censoring, survival analysis. 
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1. Introduction 
In a proportional hazard model the hazard function is defined by 

(1.1) R{t,e^\Z)=r]^o{t)fpo{Z), 

where rj^o is the baseline hazard function and /^o is the relative risk, i.e. the risk associated 
with the value of the covariate Z and relative to the risk under standard condition given 
by //3o(0) = 1. In this paper we consider general relative risk with a special interest in 
f^{z) = exp{f3z) and f^{z) = 1 + f3z which define respectively the Cox model and the model 
of excess relative risk. The functions r]^o and /^o belong both to parametric families and 
6^ = {P^,'-f^)~^ belongs to the interior of a compact set = B x F C W^~^p. To ensure that 
the hazard function is a positive function, we assume that both are positive functions. 

We are interested in the estimation of 9^ when the covariate Z is measured with error. 
If Z were measured without error, we would consider a cohort of n individuals during a 
fixed time interval [0,r]. For each individual, we would observe a triplet {Xi, Di, Zi), where 
Xi = min (Tj,Cj) is the minimum between the failure time Tj and the censoring time Cj, 
Di = lTi<Ci denotes the failure indicator, and Zi is the value of the covariate. In this paper we 
consider that the covariate is mismeasured. For example the covariate Z is a stage of a disease, 
not correctly diagnosed or a dose of ingested pathogenic agent, not correctly evaluated, so 
that the error range between the unknown dose and the evaluated dose is sizeable. In this 
context, the available observation for each individual is the triplet Aj = (Xi, Di, Ui) where Ui 
is an evaluation of the unobservable covariate Zi. The random variables U and Z are related 
by the error model defined by 

(1.2) U = Z + e, 

where e is a centered random variable, independent of Z, T, and C. The density of e is 
known and denoted by fs. Our aim is thus to estimate the parameter 6^ = {(3^,"f^)~^ from 
the n-sample (Ai,...,A„) in the presence of the unknown density g of the unobservable 
covariate Z, seen as a nuisance parameter belonging to a functional space. 

1.1. Previous known results and ideas. Models with measurement errors are deeply 
studied since the 50's with the first papers of Kiefer and Wolfowitz (|1956|) and Reiers0l 
(|W50,) for regression models with errors-in-variables. We refer to Puller H1987() and Carroll et 
al. H1995|) for a presentation of such models and results related to measurement error models. 
The interest for survival models when covariates are subject to measurement errors is more 
recent. 

To take into account that the covariate Z is measured with error, the first idea is simply 
to replace Z with the observation U in the score function defined by 



(1.3) L<')(A z(">) ='-tr( 4^ - ^ffSu! ) ""'f'^' 

n~lJo \fpiZi) 22j=iYj[t)fi3iZj) I 

where Ni{t) = lxi<t,Di=i, Yiit) = 1xi>t, = {Zi,...,Zn), and where fj^^ is the first 
derivative of fp with respect to (3. We refer to Gill and Andersen (1982) for futher details on 
()1.3() . This method, named the naive method, is known to provide, even in the Cox model, 
a biased estimator of f3^. This comes from the fact that 

lim E[lW(/3°, [/("))] / lim E[lW(/30^ z^"))] = 0. 

n— >oo n^oo 

To our knowledge, all previously known results about consistency for the semi-parametric 
estimation of the hazard function when the covariate is mismeasured are obtained in the 
Cox model. Let us present those results. Various authors propose estimation procedures 
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based on corrections of the score function L„ f/^"^). Among them, one can cite Kong 
(|1999|) who calculates the asymptotic bias of the naive estimator obtained by minimization 
of Ln\f3^ ,11^"^^), and defines an adjusted estimator. His estimator is not consistent, but a 
simulation study indicates that it is less biased than the naive estimator. In the same context, 
Buzas H1998|) proposes an unbiased score function, and shows throughout a simulation study 
that his method yields to an estimator with a small bias. Following the approach developped 
first by Stefanski H1989|) and Nakamura (,1990.) for generalized linear models, Nakamura ( 1992|) 

constructs an approximately corrected partial score likelihood, defined by L^i^^(/3, + 
a'^PN^r), where N{t) is the number of failures in the interval [0, r] and where e is a centered 
Gaussian random variable with variance o"^. Under the error model defined in (|1.2jl . this 
correction is based on the facts : 

(1.4) lim E[4}\l3, only depends on E(Z) and E[exp(/3Z)], 

n— >cxD 

(1.5) E{Z) = E{U) 

(1.6) E[exp(/3C/)] = E[exp(/?Z)]E[exp(/3e)]. 

Kong and Gu H1999|) prove that the Nakamura (|1992|) 's estimator is a -y/n-consistent and 
asymptotically Gaussian estimator of One can also cite Augustin H2UU4() who proposes 
an exact correction of the log-likelihood function. 

Again in the Cox model, an extension of the previously mentionned works is presented 
in Hu and Lin 1)2002(1 . They obtain a broad class of consistent estimators for the regression 
parameter when U is measured on all study individuals and the true covariate is ascertained 
on a randomly selected validation set. A nonparametric correction approach of the partial 
score function is also developped by Huang and Wang (2000)) when repetitions are available. 

We point out that those results strongly depend on the exponential form of the relative risk 
of the Cox model, through the use of (jl.4() - ()l.()j) and the extension of such methods to other 
relative risks is not concluding. For instance, in the model of excess relative risk without 
errors, the hazard function is defined by R{t, 9^\Z) = r]^o{t){l + P^Z) and the score function 
is given by 

In this model, the naive method also provides biased estimator of (3^, since 
lim E[L«(/30, [/("))] / lim E[L«(/30, Z^"))] =0. 

n— »oo n— +00 

Indeed, easy calculations combined with the Law of Large Numbers give that the limit 
lim„^oo E[L'n\l3, Z("))] depends on E[Z/{1 + (3Z)] whereas the limit lim„^oo E[Li^^(/3, ?/("))] 
depends on E[C//(1 + PU)]. Since the error model ()1.2|) does not provide any expression 
of E[Z/(1 + PZ)] related to E[U/{1 + PU)], corrections analogous to the ones proposed in 
the Cox model cannot be exhibited. In other words, it seems impossible to find a function 
^{P, U), independent of the unknown density g satisfying that E{^{P, U)) = E[Z/(1 + PZ)]. 
Consequently the methods proposed in the Cox model, by Nakamura (1992), Kong and Gu 
(fTT?99;) . Buzas (fTT^ . Lin Huang and Wang pTO]) or by Augustin (2004) cannot be 

applied to the model of excess relative risk and a fortiori to a model with a general relative 
risk. As a conclusion, methods based on a correction of the partial score likelihood (jl.l-ij) 
where Z is replaced with U seem not concluding in a general setting. 

An other possible way is to consider a partial log-likelihood related to the filtration gen- 
erated by the observations. If the covariate Z were observable, then the filtration at time 
t, generated by the observations would be a{Z, N{s),lx>s,0 < s < t < r}, and the in- 
tensity of the censored process N{t) with respect to this filtration would equal X{t,9^,Z) = 
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r]^o{t)Y{t)fpo{Z). In case of covariate measurement error, Z is unobservable and only the 
evaluation U is available. In this context, the filtration generated by the observations is 
£t = (t{C/, iV(s), Ix>si < s <t < r}, and the intensity of the censored process N{t) with 
respect to the filtration £t equals 

nKt,0^,Z)\8t]=r^^n{t)Ym[fpo{Z)\a{U,lT>t)]. 

This is studied in the Cox model by Prentice (|1982j) who proposes the following induced 
failure hazard function 

r/^o(t)E[exp(/30z)|f7(lT>t,C/)]. 
The presence of {T > t} in the conditioning usually implies that the induced partial log- 
likelihood has not explicit form. When the marginal distribution of Z given {T > t,C/} is 
specified at each time t, Prentice (|1982|1 proposes an approximation of the induced partial 
log-likelihood independent of the baseline hazard function. Nevertheless this approximation is 
appropriate only when the disease is rare. Tsiatis et al. ()1995() propose another approximation 
of the induced partial log-likelihood, but emphasise that their method cannot be used for the 
model of excess relative risk. 

In the Cox model with missing covariate Pons (|2()()2|1 uses also the partial likelihood. 
She proposes an estimator based on where E[exp(/3Z)|(T([/, lT>t)] is replaced with 

E[exp(/3Z)|[/] by considering that 

(1.7) E[exp(/3Z)|f7(t/, lT>t)] = E[exp(/3Z)|C/]. 

Nevertheless, obvious examples can be exhibited to prove that the equality (|1.7|) does not 
hold in a general setting. As a conclusion, the partial likelihood related to the filtration £t 
seems unusable since it is difficult to separate the estimation of (3^ from the estimation of 

1.2. Our results. Our estimation procedure is based on the estimation of least squares cri- 
terion using deconvolution methods. More precisely, using the observations Aj = (Xi,Di, Ui) 
for i = 1, . . . , n, we estimate the least squares criterion 

(1.8) 5eo,g(^) = E {^fl{Z)W{Z) 1^ Y{t)r^^^{t)dt^ - 2E l^fp{Z)W{Z) l^^{t)dN{t)^ . 

The function PV^ is a positive weight function to be suitably chosen such that W fp and its 
derivatives up to order 3 with respect to (3 are in Li(M)nL2(M) and have the best smoothness 
properties as possible, as functions of z. Under reasonable identifiability assumptions, SgO g{6) 
is minimum if and only if = 6^. We propose to estimate Sgo^g{9) for all ^ G O by a quantity 
depending on the observations Ai, • • • , A„, expecting thus that the argument minimum of 
the estimator converges to the argument minimum of SgO g{0), say 9^. 

We propose a first estimator of 9^, say 9i, constructed by minimizing Sn,i{9), a consistent 
estimator of Sgo^g where g is replaced by a kernel deconvolution estimator. We show that 
under classical assumptions, this estimator is a consistent estimator of 9^. Its rate of conver- 
gence depends on the smoothness of fs and on the smoothness on W{z)fi3{z), as a function 
of z. More precisely, its depends on the behavior of the ratios of the Fourier transforms 
iWfi3)*it)/J*{t) and {Wf'^)*{t)/J*{t) as t tends to infinity. We give upper bounds for the 

risk of 9i for various relative risks and various types of error density and derive sufficient 
conditions ensuring the -^/n-consistency and the asymptotic normality. These upper bounds 
and these sufficient conditions are deeply studied through examples. In particular we show 
that 9i is a -y/n-consistent asymptotically Gaussian estimator of 9^ in the Cox model, in the 
model of excess relative risk, and when is a general polynomial function. 

The estimation procedure is related to the problem of the estimation SQ0 g(9). Under condi- 
tions ensuring that it can be estimated at the parametric rate, we propose a second estimator 
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02 which is y^-consistent and asymptotically Gaussian of 9^. Clearly, these conditions are 
not always fullfilled and O2 does not always exist, whereas 9i can be constructed and studied 
in all setups. 

The paper is organized as follows. Section 2 presents the model and the assumptions. In 
Sections 3 and 4 we present the two estimators and their asymptotic properties illustrated in 
Section 5. In Section 6, we comment the use of the least squares criterion. The proofs are 
gathered in Section 7 and in the Appendix. 

2. Model, assumptions and notations 

Before we describe the estimation procedure, we give notations used throughout the paper 
and assumptions commonly done in survival data analysis. 

Notations For two complex- valued functions u and v in L2(M) n Li(R), let 

u*{x) = J e^^^u{t)dt, u-kv{x) = j u{y)v{x — y)dy, and <u,v>= j u{x)v{x)dx 

with z the conjugate of a complex number z. We also use the notations 

ll^lli = / \u{x)\dx^ 11^11^= / \u{x)\'^dx, ||n||oo = sup 



and for 9 G 



k=l 

For a map 

ife ■■ X R — > R 
{9,u) ^ ifeiu), 

whenever they exist, the first and second derivatives with respect to 9 are denoted by 
= with^g(.) = ^|^foriG{l,...,m+p} 

and v^f(-) = with .^£^0 = ^^, for ^ 

Throughout the paper P, E and Var denote respectively the probability, the expectation, and 
the variance when the underlying and unknown true parameters are 9^ and g. Finally we use 
the notation a_ for the negative part of a, which equals a if a < and otherwise. 
Model assumptions 

(Ai) The function ry^o is non-negative and integrable on [0,r]. 
(A2) Conditionnally on Z and U, the failure time T and the censoring time C are 
independent. 

(A3) The distribution of the censoring time C, conditional on Z and U, does not 
depend on Z and U. 

(A4) The distribution of the failure time T, conditional on Z and U, does not 
depend on U. 

These assumptions are usual in most frameworks dealing with survival data analysis and 
covariate measured with error, see Andersen et al. f;1993iil . Prentice and Self (T983j. Pren- 
tice dHHSJ), Gong (|199U|) and Tsiatis p995|) . Assumptions (j^^) and (jX^) state that a general 
censorship model is considered, where the censoring time has an arbitrary distribution in- 
dependent of the covariates. Assumption ( IA4I) states that the failure time is independent 
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of the observed covariate when the observed and true covariates are both given, i.e. the 
measurement error is not prognostic. 
We define the filtration 

J't = 'y{Z,U,N{s),lx>s.Q<s<t<T}. 

The intensity of the censored process N{t) with respect to the filtration equals 

(2.1) \{t,e\Z) = 7^^,{t)Y{t)fp,{Z). 

It follows from (|2.1|) and from the independence of the observations Aj, that for the indi- 
vidual i the intensity and the compensator process of the censored process Ni{t) = IXi<t,Di=i 
with respect to the filtration J^t are respectively 



(2.2) A,(t,0°,Z,) =r/^o(t)y,(t)V(Z,) and Ai(t,0O,Zi) = / h{s,e\Z, 

Jo 



)ds. 



Moreover the process Mj(t) = Ni{t) — Aj(t, 9^, Zi) is a local square integrable martingale. As 
a consequence, the least squares criterion defined in H1.8() can be rewritten as 







2.3) Se.,,{e) = / E {r^^{t)fp{Z)-ri^.{t)fp.{Z)fY{t)W{Z) 



dt 



E 







{il^o{t)fp.{Z)fY{t)W{Z) 



dt. 



Since we consider general relative risk functions we assume the below minimal smoothness 
conditions with respect to 9. 
Smoothness assumptions 

(As) The functions P ^ and ^ rj^ admit continuous derivatives up to order 3 
with respect to /3 and 7 respectively. 

:W IQ\ ^^A c(2) 



We denote by S^o {&) and 5^o {6) the first and second derivatives of Sqo g{9) with respect 

a t 

Identifiability and moment assumptions 



(2) 

to 9. For all t in [0, r], set S^g {9, t) the second derivative of SgO g when the integral is taken 

(2) (2) 

over [0,t], with the convention that Sgo^{9) = Sqo^{9,t). 
lity and moment asi 

(As) S^l^gi^) = if and only if 9 = 9^. 

(A7) For all t G [0,r], the matrix sil^ (S^^t) exists and is positive definite. 



(As) The quantity E(/|(Z)VF(Z)) is finite . 

(Ag) For i = !,••• ,m, E\ffso{Z)f^^o]^{Z)W{Z)f, E\ffso{Z)W{Z)f are finite. 

We denote by Q the set of densities g such that the assumptions ( |A2D -( r^^ ,( |A6D -( |A9D 
hold. 

3. Construction and study of the first estimator 9i 
3.1. Construction. If the Zi's were observed, Sgo^g{9) would be estimated by 



2 



(3.1) Sn{9) = -^J2jfs{Zi)W{Zi) J^rj^{t)dNi{t) + -J2fl{Zi)W{Z,) / 7j-'^im{t)dt 
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and 6^ would be estimated by minimizing Sn{6)- Since the Zj's are unobservable and Zi 
independent of £i, the density h of Ui equals h = g* f,.. We thus estimate SgO g by 



(3.2) 



Y,iff3W)*Kn,cAU^) / v^it)dN,it)+-Y,{fpW)*Kn 

1=1 i=l 



n 



Sn,i{e) = — yifpW)^Kn,cSUi) / v^{t)dN,{t)+-yifiw)*Kn,cAUr) / vmYi{t)dt, 







where i^n,c„(") = CnKn{Cn-) is a deconvolution kernel defined via its Fourier transform, such 
that / Kn{x)dx = 1, and 

(33) K* (t) ^"^^^ K*^/Cn) 

with iT* compactly supported satisfying |1 — K*{t)\ < ^\t\>i and ^ oo as n — > cx). 
Using this criterion we propose to estimate 9^ by 

(3.4) ^1 = (lA = arg min Sn.i(.0). 

We require for the construction of Sn,i{0), that 

(Aio) the density fe belongs to L2(M) n Loo(M) and for all x G M, f*{x) 0. 

3.2. Asymptotic properties of the first 9i. Assume that 

(All) sup II fLg \\l< C2{flo), sup II fpog \\l< C2(//3o). 

see see 

(A12) sup(Ty//3), and sup(VF/|) belong to L] 



(A13) ^MWff^) and sup(H^/^4')) belong to Li( 



As in density deconvolution, or for the estimation of the regression function in errors-in- 
variables models, the rate of convergence for estimating 9^ is given by both the smoothness 
of fe and the smoothness of {fpW){z), and d{fpW){z) / df), as functions of z. The smoothness 
of the error density is described by the decrease of its Fourier transform. 

(A14) There exist positive constants C(/e),C(/e), and nonnegative 5, p^a and uq such 

that Qife) < \ f*e{u)\ |'urexp(5|n|0 < C{fe) for all |n| > uq. 

If /> = 0, by convention (5 = 0. When p = Q = 5 m. ( IA14D , is called "ordinary smooth". 
When 5 > and p > 0, it is called "super smooth". Densities satisfying GA14D with p > and 
(5 > are infinitely differentiable. The standard examples for super smooth densities are the 
Gaussian or Cauchy distributions which are super smooth of respective order a = 0, p = 2 and 
a = 0, p = 1. For ordinary smooth densities, one can cite for instance the double exponential 
(also called Laplace) distribution with p = Q = 5 and a = 2. We consider here that < p < 2. 
The square integrability of fe in dAioD requires that a > 1/2 when /> = in (IA14D . 
The smothness of f^W is described by the following assumption. 

(A15) There exist positive constants L{f),L{f) and a,d,uo,r nonnegative numbers such 
that for all /? E B, fjsW and f'^W and their derivatives up to order 3 with 
respect to /?, belong to 

(3.5) na,d,r = {/ G Li(M);L(/) < \ f* {u)\\u\'' eMdH') < L{f) < 00 for all |n| > uq}. 

If r = 0, by convention d = 0. 
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Theorem 3.1. Let ^K^-^K^ hold. Let Oi = ^i(C„) he defined by B^) and with Cn 
a sequence such that 



(3.6) 



^{2a-2a+i-p+(i-p)^) exp{-2dC^ + 2(5C^}/n = o(l) as n ^ +oo. 



1) Then E(|| ^i(C„) — 9 = o(l), as n —> cxd and ^i(C„) is a consistent estimator of 9 . 

2) Moreover, E(|| 9^ - 9^ \\%) = 0{^l) with = \\{ipn,M% , = Bl^{9^) + 



where Bl -{9^) 



2 
9 



11 



+ 



+ ll(/^oVFr(K5„-i)li; + 

2 



+ 



+ 



/I 



+ 



ft 



3) Furthermore, the resulting rate if"^ is given in Tabled 

We point out that the rate for estimating depends on the smoothness properties of 
d{W fi3){z)/d(3 and d{W f'^){z)/df3 as a function of z, whereas, the rate for estimating 7*^ 
depends on the smoothness properties of Wffs{z) and Wf'^{z) as a function of z. In both 
cases, the smoothness properties of function of t does not have influence on the rate 

of convergence. 

The terms B'^ ■ and Vn,j are respectively the squared bias and variance terms. As usual ,the 
bias is the smallest for the smoothest functions {W f0){z) and d{fii,W){z) / 8(3., as functions 
of z. As in density deconvolution, the biggest variance are obtained for the smoothest error 
density /g. Hence, the slowest rates are obtained for the smoothest errors density /g, for 
instance for Gaussian e's. 

The rate of convergence of the estimator 9i could be improved by assuming smoothness 
properties on the density g. But, since g is unknown, we choose to not assume such properties. 
Consequently, without any additional assumptions on g, the parametric rate of convergence 
is achieved as soon as {W fp) and {Wf'^) and their derivatives, as functions of z, are smoother 
than the errors density f^. 

3.3. Consequence : a sufficient condition to obtain the parametric rate of conver- 
gence with 9i. 



There exists a weight function W such that the functions 



snp{fpWr/f* , sup(/^Ty)7/| belong to Li 



I3& 



The functions sup(/^^V) /f* and sup ( /^'^^VF) 



f3& 



■(1). 
'/3 



belong to Li(M) n L2 



(C3) 



The functions ifg^W) /f* and 



) //I 



belong to Li(M) n L2(M) for all /? G 
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p = in ( IA14I I 

ordinary smooth 



p > in ( IA14I I 

super smooth 



Wfpo 



d = r = 
in flAistt 

Sobolev 



a < a + 1/2 n 



a>a+l/2 rr^ 



(logn)" 



r > 
in ( lAisl l 



r < p 



(logn)^('"-'->'')exp{-2d(l^)'-/^} 



d < 5 (log n) A(a,r,p) + 2<.d/{ar)^-d/5 

d = 5, a<ci+l/2 (logn) 

d = 5, a > a + 1/2 n"^ 
d > (5 



r = p 



r > p 



where A{a,r, p) — (— 2a + 1 — r + (1 — r)_)/p. 
Table 1 . Rates of convergence (f'^ of 6i 



Theorem 3.2. Let ^A^-^A^^ and (fCilj-ilCaP hold. Then Oi defined by ^[3^ is a 
consistent estimator of 6^ . Moreover 



where Si equals 
(3.7) 

d\UpW)[Z)n,{s)) 



with 
So,i = E J 











502 

502 



1,^,0 dN{s) + 



92((/2H^)(Z)r?2(.)) 



502 



de 



\e=e^^dN{s)+ [ 
Jo 



de 



Y{s)ds 



T 
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where 



p—itU r p—itU 

Rpj,,i{U)= j{Wfpr{t)=^dt and R^j^^U) = J {Wflnt)=^dt. 



The conditions dCiD - jns] ), stronger than ( IA14I ) and ( IA15D , ensure the existence of the 
functions Rpj^j for j = 1, 2. 



4. Construction and study of the second estimator 62 

4.1. Construction. Our estimation procedure, based on the estimation of the least squares 
criterion, requires the estimation E[£ {ff3W){Z)dN{t)] and E[£ {f^W){Z)Y{t)dt], which 
are hnear functional of g. It may appear that these linear functional could be directly esti- 
mated, without kernel deconvolution plugg-in. In this context, we propose another estimator 
of 9^. It is based on sufficient conditions allowing to construct a y^-consistent estimator of 
these linear functionals and hence to estimate SgO g with the parametric rate. 

We say that the conditions ([UlJ'l^il hold if there exist a weight function W and two 
functions ^pj^^i and $/3j^,2 not depending on g, such that for all /? G B and for all g 



(C, 



and E, 







E, 



ifpW){Z)dN{t) 

{flW)iZ)Yit)dt 







<^pj^,i{U)dN{t) 

^p,fMU)Y{t)dt 



For k = 0,1,2 and for j = 1, 2, E[sup AU)\\e2] < 00; 



(Ce) For j = 1, 2 and for ah /3 E B, E 

Under ( |C4D -( [n6] ), we estimate SgOg by 



< 00. 



(4.1) Sn,2{0) = --iZ I '^PJe,imr]^it)dNiit) + -y2 [ '^PJe,2iUi)Vjit)Yiit)dt 
" i=l " i=l -^0 



and 9^ is estimated by 
(4.2) 



62 = argminS'„,2(6')- 



The main difficulty for finding such functions ^pj^^i and ^i3j^^2 lies in the constraint that 
they must not depend on the unknown density g. We refer to Section f4. 31 for details on how 
to construct such functions ^^j^j, j = 1,2. 

4.2. Asymptotic properties of 92- 

Theorem 4.1. Let dAiD - fAgl ), and the conditions ([C4D - IC6D hold. Then 92, defined by i4-S\ ) 
is a y/n- consistent estimator of 9^ . Moreover 



V^(e2 - 9^ 



AA(0,S2), 
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where E2 equals 
(4.3) 



r fi2 



xSo,2 \^ 

with 
So,2 = E \ 











e=eo dNis) + / 
Jo 



Y{s)ds 



le=0o diV(s) + \e=eoYis)ds 



do 

2/ 



a($/3,/„i(C/)r?^(s)) 



\e=eodN{s) + \e=eoYis)ds 



4.3. Comments on conditions ensuring y^-consistency : comparison of 61 and 



Let us briefly compare the conditions dCiD - JUat to the conditions ((^^J'^Eil- note- 
worthy that the conditions ([^^'"(Eil more general. First, the condition ( IC4D does not 



require that fpW, f^W belong to Li(]R) (as for instance in the Cox Model). Second, we point 



out that Condition ( |CiD implies ( IC4D , with $ 
under JUTld^, by denoting ^>/3,/„i 

]E[y(t)i>A/.,2(^)] = 



R 



. This comes from the facts that 




{Wf^r/f* and $ft/,,2 = iWf',r/fi, we have 
'^x>t^/3j,,2iu)fx,z{x,z)feiu - z)dx du dz 



Ix>t/x,z(x,^)— y ^lj^^^{s)e-~'''f*{s)dsdxdz 
' ^^^''.-'''fl{s)dsdxdz 



'S^x>tfx,z{x,z] 



2lT 



ny{t){wfi){z)]. 



Consequently 



E 



Jo 



^(,,f,,2iU)vlit)Y{t)dt 



E 



and analogoulsy 



E 



/3,/.,l 



{U)r]^it)dN{t) 







= E 









fl{Z)W{Z)if^{t)dt 



fp{Z)W{Z)r]^{t)dNit) 



Hence Condition ( IC4D holds and So,i = Ilo,2 with Sq^i defined in Theorem 13.21 

These comments underline the key importance of the weight function W. For instance, if 
fp{z) = 1 — /? + /3/(l + z^), and is the Gaussian density, then it seems impossible to find 
a function ^pj,,2 such that E[y(t)$^j^,2(C^)] = E[Y{t)f^{Z)], whereas ((nTJ-lQ hold by 
taking M^(z) = (1 + z^)'^ exp(— z^/(4(5)). In this special example, we exhibit a suitable choice 
of W that ensures that condition dCiD - jnaj ) are fulfilled (see Section El for further details). 
Nevertheless, such weight function are not always available and hence those conditions dCiD - 
are not always fulfilled. 
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5. Examples 

In this section, we illustrate the asymptotic properties of 9i and 62 for various relative risks 
and error density fs- In all of these examples, K*(t) = I|t|<i and the noise distribution is 
arbitrary, as far as it satisfies QAiqI ) and ( IA14D with < /? < 2. 

The first example deals with Cox model. We show that our estimation procedure, based 
on a nonparametric method and specifically on density deconvolution, also provides ^/n- 
consistent and asymptotically Gaussian estimator of f3^. The aim of this example is to show 
that we recover previous known results using estimators that are quite different from the ones 
proposed by Nakamura H1992|) and studied by Kong and Gu 1)1999(1 or from the ones proposed 
by Augustin f^MJ^ . 

The others examples we consider, deal with relative risks for which no consistent estimators 
were known when the covariate is mismeasured. 

Example 1. Exponential relative risk (Cox model) 

Let fp be of the form ff3{z) = exp(/5z) and assume that E[exp(2/3''[/)] < 00. Let W{z) = 
exp{— z^/(4(^)}. Then the conditions JCiD - dns] ) as well as the condition ( |C4| )-( [n6] ) are satis- 
fied. Consequently the estimators Oi and 62 are -y/n-consistent and asymptotically Gaussian 
estimators of 6^, with the same asymptotic variance. 

One could also choose 14^ = 1 and use that E[exp(/3Z)] = E[exp(/3[/)]/E[exp(/3e)]. This 
implies that if we denote by 

exp(2/3[/) , ^ exp(/3C/) 



E[exp(2/3e)] ' E[exp(/3e)] 

then E[$^j^,i(C/)] = E[/J(Z)] and E[y(t)$^j^,2(C/)] = E[y(t)/^(Z)], and the criterion Sn,i 
defined by (|4.1|1 exists. 

In this case Q2 is also a -^/n-consistent and asymptotically Gaussian estimator of (P . 



Example 2. Polynomial relative risk 1 (included Excess relative risk model) Let 

//3 be of the form jp{z) = 1 + X^^Li A-z'^ and let W{z) = ex.p{— z"^ / (45)} . Then conditions 
^[CiD - dUa] ) as well as conditions ( |C4D -( [n6] ) are satisfied. Consequently the estimators 9i and 
62 are y^-consistent and asymptotically Gaussian estimators of 6^, with the same asymptotic 
variance. 

We point out that when m = 1, fp{z) = 1 + Pz, and this model is known as the model of 
excess relative risk. 

One can also choose W = 1, provided that the kernel K has finite absolute moments of 
order m and satisfies J u^'K{u)du = 0, for r = 1,. . . ,m. With this choice of W, 61 remains 
a -y/n-consistent and asymptotically Gaussian estimator of 6^. 

Example 3. Cosines relative risk 1 Let /^j be of the form fi3{z) = ^Jli /3j cos(jz) 
with Ylf=iPj = 1- Let W{z) = exp{-zV(4(5)}. Then the conditions ([nTJ-lin^) as weh as 
conditions ( |C4D -( [C6] ) are satisfied. Consequently the estimators 9i and 62 are -^/n-consistent 
and asymptotically Gaussian estimators of 6^, with the same asymptotic variance. 

One can also choose W = 1 and use that E[exp(zjZ)] = ¥,[ex.p{ijU)]/K[ex.p{ije)]. This 
implies that if we denote by 



expjijU) exp{-ijU) 

fiU) HU) 
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and 



m 

2 



exp(2ijC/) exp(— 2ij?7) 

nm ^ Tern 



j=i 



exp{i{j + k)U) exp(-i(j + k)U) exp{i{j - k)U) exp(i(-j + k)U) 
+ HU + k) f*e{j-k) 7*ij-k) 



then the criterion Sn,2 defined in ()4.ip exists. 

With this choice of W^, 62 remains a -^/n-consistent and asymptotically Gaussian estimator 
of 9^. In the same way, 61 with W = 1 also remains -^/n-consistent and asymptotically 
Gaussian estimator of 6^. 

Example 4. Cauchy relative risk 1 Consider of the form f/3{z) = 1 — /3 + /3/(l + z^). 
Then ffj has the regularity of z 1/(1 + z^) which belongs to Ha,d,r defined in flAisP with 
a = 0, d = 1/2 and r = 1. Let W{z) = (1 + z^)^ exp{-zV(4(5)}. Hence the functions ff^W, 
f^W and their derivatives in (3 up to order 3 belong to Ti.a,d,r defined in ( IA15I ) with p < r = 2 
or p = r = 2 and d > 5. Consequently, the conditions dCiO - dUs] ) as well as conditions ( IC4I )- 
jCel ) are satisfied and the estimators 9i and 62 are -y/n-consistent and asymptotically Gaussian 
estimators of 6^, with the same asymptotic variance. 

This simple example underlines the importance of the smoothing weight function W in 
the construction of Oi or 62- Indeed, without a smoothing function W in front of the relative 
risk. Theorem 13 . II predicts a rate of convergence of order exp(— 2^1og n) for Gaussian e. 

Example 5. Laplace relative risk Consider of the form fpiz) = 1 + Pf{z) with f(z) = 
exp(— 12;|/2) — 1. Since the Fourier transform of z exp(— 12;|/2) is slowly decaying, like 
|u|~2 as |n| — > 00, if we choose W = 1, the estimator 61 is not -yn-consistent as soon as 
l/e (w)| < o(|u|~2) with |n| — > 00. A closer look tells us that and its derivative in f3 is C°° 
except at one point z = 0. Therefore, a proper choice of W can smooth out at and make 
Wff3, W f'p and their derivatives in (3 infinitely difFerentiable functions in z. This choice of W 
ensures the y^-consistency of 9i whatever satisfies GA14D with < p < 1. Even \i p> 1, 
the rate of 9i is much faster when using our choice of W then it would be for W = 1. Let us 
precise the choice of W. Set 



(5.1) ^a,b,r{^) = exp (^T^pp^^ ) ^[AB]{z), 

where — oo<yl<i?<cxD are fixed and R > 0. According to Lepski and Levit 1)19981) and 
Fedoryuk (fTW|l . p. 346, Theorem 7.3, ^ cexp(-C|ii|^/(^+^)), as \u\ 00 and 

c, C are positive constants. We propose to take W equal to ^'o,ioo,R or ^'-ioo,o,i? or their 
sum. 

This choice of W ensures that fpW, f'^W and their derivatives up to order 3 belong to 
T~(-a,d,r defined in ( IA15D with d > and r = R/{R + 1) closer to 1 as i? comes larger. 

If /e satisfies ( IA14I ) with < p < 1, we choose R large enough such that r = Rj (-R+ 1) > p. 
Hence, the conditions ( |CiD -( [np as well as the conditions ( |C4D -( [U6t are satisfied. Conse- 
quently the estimators 9\ and ^2 are y^-consistent and asymptotically Gaussian estimators 
of 0", with the same asymptotic variance. 
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If p > 1, for this choice of W, the functions Wfjs and and their derivatives in (3 up 
to order 3, belong to Ti.a,d,r with r = R/{R + 1) and hence, according to Tabled 

E II 01-9° 11^2= 0(1) (log n)^^^exp{-2d(logn/(25)f/^}. 

Example 6. Irregular relative risk Consider of the form f^{z) = 1 — /3 + /3I[_x,i] (-2) 
and take W = ^'-i,i,r defined by (inH} for i? > . 

If p = in ( IA14I ), then Conditions ( |Ci| -( [n3t as well as Conditions ([n^J'^Eil satis- 
fied. Consequently the estimators 61 and 62 are -y/n-consistent and asymptotically Gaussian 
estimators of 6^, with the same asymptotic variance. 

If /9 > 0, then the best rate for estimating 6^ is obtained by choosing W = ^'-i,i,_r with 
R > sufficiently large such that Wffs and Wf^ and their derivatives in (3 up to order 3, 
belong to Ti.a,d,r defined in ( IA15I ) with < r = R/{R + 1) < 1 as close to 1 as needed. 

It follows that if < p < 1, then we can find W = ^-i^i^r belonging to 'Ha,d,r with r = 
R/{R + 1) > p. Hence the conditions dCiD - JH^ as well as conditions ([H^l-ll^el are satisfied 
and the estimators 61 and 62 are y^-consistent and asymptotically Gaussian estimators of 

If p > 1, for W = the functions W fp and W f'^ and their derivatives in j3 up to 

order 3, belong to 7ia,d,r with r = R/{R + 1) and hence, according to Tabled 

E II ^1 - \\%= 0(1) (logn)^^^ exp{-2d(logn/(25))'^/^}. 

Example 7. Polygonal relative risk Consider f/s with ff3{z) = 1 — (32a- — /Ssl^P + Piz + 
(32{z — a)l.z>a + Pslz — ftp. This relative risk is except at points a and b where it is 
not differentiable. We suggest to use the smoothing weight function in (|5.1|) as follows. For 
R>0, let 

If the noise satisfies ( IA14I ) with < p < 1, then take R large enough such that r = 
R/{R+1) > p and thus conditions dCiD - JCat as well as conditions ([^}-((C6j are satis- 
fied. Consequently the estimators 81 and 62 are -y/n-consistent and asymptotically Gaussian 
estimators of 6^, with the same asymptotic variance. 

If p > 1 in flAi4|) , the functions Wfp and and their derivatives in f3 up to order 3, 
belong to 'Ha,d,r with r = R/{R + 1) and hence, according to Tabled 

E II ^1 - 0° 11^2= 0(1) (logn)^^T^ exp{-2d(logn/(2(5))'^/^}. 
Comments on the examples [SI |Hl and [7| 

In these three examples, ffsW belongs to 7ia,d,r defined in ( IA15I ) with r at most such that 
r < 1. Hence 9i achieves the y^-rate of convergence provided that fs is ordinary smooth or 
super smooth with an exponent p < 1. It seems therefore impossible to have {W f^)* / f* in 
Li(]R) when the Ej's are Gaussian. This comes from the fact that for these relative risks, the 
least squares criterion SQ0 g{9) cannot be estimated with the parametric rate of convergence 
and hence could probably, not provide a -^/n-consistent estimator of 6^. Nevertheless, even in 
cases where the parametric rate of convergence seems not achievable by such estimators, the 
resulting rate of the risk of 9i is clearly infinitely faster than the logarithmic rate predicted 
by Table ^ that we could have without W . 

In most of previous examples where the weight function W is required, the points where 
ff3{z) has to be smoothed do not depend on /?. But in survival data analysis the relative risks 
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//3 are usually of the form f/3{z) = f{f5z) (see for instance Prentice and Self (|1983j) ). In such 
models, the points where fp^z) has to be smoothed (as function of z) will depend on (3. 
Let us present such examples. 

Example 8. Polynomial relative risk 2 Let be of the form f{j3z) with f{z) = 1 + 
Y^^=i^kZ^ with known Ofc's. Let W{z) = exp{— z^/(4(5)}. Then conditions dCiD - JUs] ) as 
well as conditions ( |C4D -( [n6] ) are satisfied. Consequently the estimators 6i and 62 are ^/n- 
consistent and asymptotically Gaussian estimators of 6^, with the same asymptotic variance. 

Example 9. Cosines relative risk 2 Let fp be of the form f{(3z) with f{z) = XlJLi ^os{jz) 
with known a^s such that XlJLi ~ W{z) = exp{— 2^/(4(5)}. Then the conditions 

jjCiD - dnsI ) as well as conditions ( |C4D -( [n6] ) are satisfied. Consequently the estimators 9i and 
02 are -y/n-consistent and asymptotically Gaussian estimators of 6^, with the same asymptotic 
variance. 

Example 10. Cauchy relative risk 2 Consider fp of the form f{f3z) with f{z) = l/(l+z^). 
Let W{z) = (1 + z^)^exp{— z^/(4(5)} or 1^ = 1. With these choices of the functions 
fpW^ fpW and their derivatives in j3 up to order 3 belong to Tla,d,r defined in ( [AisD with 
a = 0, d = 1/ (3 and r = 1. According to Table Q if fe satisfies ( IA14I) with < p < 1, then 
61 and 02 are -^/n-consistent and asymptotically Gaussian estimators of 6^, with the same 
asymptotic variance. If fs satisfies (IA14D with p > 1 and then 61 and 62 are consistent with 
a rate that depends on P^. Let us be more precise. According to the proof of Theorem 13.11 
for j = 1, . . . , m + p, the term B^ j{6^) are of order exp(— 2(7^//?'^) and the term Vnj{9^)/n 
are of order C72a+(i-p)+(i-p-) exp(-2C7„//30 + 26C^)/n. 

Set C* that realizes the best compromise between the squared bias and the variance terms. 
It is independent from and is given by 



n 

This choice yields to the rate 



logn (2a + (l — /logn 

log 



26 26p 26 



ffi = max < n ^, exp 



2 /logn 2a + (1 — /))_ /logn 

log ' 



(log n 



/30 V 25 25p ^\ 25 

In other words, if p = 1, then E || 9i -9^ 11^^= 0(1) max |n-\ n-V(/3'''5)(logn)2°/(/3'''5) | and 
ifp> 1, thenE || ^1-^° ||22=0(l)exp [-2(/30)-i (log n/(25))i/^ . 

6. Comment on the use of the least squares criterion 

In a proportional hazard model without errors, the main drawback of the least squares 
criterion, compared to the partial log-likelihood, is that it does not allow to separate the 
estimation of f3^ from the estimation of the baseline hazard function rj. The subject of this 
part is to motivate the choice of the least squares criterion when the covariate is mismeasured. 

First, as it is mentionned in the introduction, the partial log-likelihood related to the 
filtration given by the observations only has not an explicit form. 

Second, consider as a partial log-likelihood related to the failure hazard function defined 
by jni) 

iV^ Tl f Eifp{Z,)\a{U„lT,>t) I 
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This partial log-likelihood depends on the observations, on the density g of Z and on r]^o, 
through K[fp{Z)\a{U,lT>t)]- Hence the estimation of also depends on t] through the 
conditionning. 

Lastly, since the Zi's are unobservable, one other idea would be to estimate (3^ by mini- 
mizing Ln\p, f/^"^) given by 



n 



i=l 



T.UYmf?W)^Kn,cSUj 



W^Kn,cAUi)dN,{t), 



for W a suitable chosen weight function, with W{z) ^ for all z in M. Due to the unobserv- 
ability of Wif^M'"'^) 

can be seen as an estimation of the expectation of (|1.3j) . Under 
reasonnable assumptions, j3^^ such that Ln\j3^^, = is a consistent estimator of P^. 
The main difficulty lies in the study of its rate of convergence. As in the study of 9i, the rate 
of convergence of depends on the smoothness of {fj^^W){z)/{ffs){z), as a function of z, 
through the behavior of the ratio 



it) 



as t 



oo. 



Consequently, the best properties would be obtained for W such that (/^^^W^)/(//3) is in 
Li(M) and has the best smoothness properties. In the Cox model, fp'\z)/ff^{z) = z and 



this estimation criterion provides -^/n-consistency and asymptotically Gaussian estimator, 
analogously to the Nakamura's (|1992|) 's estimator. The same result holds for the relative 
risks considered in Examples 01 and IHl Nevertheless, for general relative risks, this criterion 
is less tractable than the least squares criterion (|,'-{.2|) . since it is strictly more difficult to 
"smooth" z 1-^ f^\z)/ ffj{z) than z i— > fp{z). This appears in a crucial way in the model of 

excess relative risk where fj^\z)/ffj{z) = z/(l + This point has to be related to the 

difficulty and even the impossibility to find a suitable correction of Ln\p, J/^"^), which leads 
to asymptotically unbiased score functions (see the introduction). 



7. Proofs 

Prom now C denotes any numerical constant and C{A) indicates that it depends on a A. 
7.1. Proof of Theorem IXTl 

7.1.1. Consistency. By classical arguments, the consistency follows from the two points : 

1- The quantity Sqo g{6) is minimum if and only \i 6 = 6^. 

2- For all 6' G G, E[5'„,i(6i) - Seo^g{9)f = o(l) as n ^ oo, with Seo^g{e) defined in (Ol) . 



3- If oj{n,p) denotes u{n,p) = sup {|S'n^i(0) — S., 



\6 — 6'\\i2 < p} , there exists p/. 



tending to 0, such that E[a;(n, p^)] = 0{p\) as n ^ cx) VA; G N. 



Proof of 1- Under ( [AeD , by applying 1)2. 3|) we get 



d_ 
'dp 



2 / E 
'o 



'fj^\z,)ri,it) {r],it)MZ) - 77^o(t)/^o(Z)} VF(Z)y(t)' 
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and 

The matrix of second derivatives equals 



with 



2 / E 

Jo 

2 E 



dt 







2 / E 



/^o (Z) V (Z) (r?« (t))TTy (t) 







{v^^hmv^'ht)Vfpoiz)wiz)Yit) 



dt. 



An obvious appUcation of Cauchy-Schwarz InequaUty gives that the matrix H is non negative 
definite and hence under ( IA7D 1- is proved. 
Proof of 2- 

For both the bias and the variance, we wih give two upper bounds, based on the two 
following applications of the Holder's inequality 

(7.1) 

and 



< y^i,^2 > I <|| ^1 II2II V2 II2, 



(7-2) \ <^l,^2>\ <\\ 9^1 llooll V52 111 ■ 

According to Lemma l8. II we write that 

nSn,im = MW) * Kc„ {ZW,{t)Y{t) - 2{fpW) * Kc^ {Z)r^,{t)fp, (Z)7?^o {t)Y(t)] dt, 

Jo 

and hence 

E{Sn,i{e))-SeoJ9) = I j\^^{t)l,>t{fxA^r),{flW)*Kc^- flW)dxdt 

-2 j l\.^o{t)r]^{t)l,>t{fpo{-)fx,z{x,-),{fpW)*Kc„- fpW)dxdt. 
By applying (|7.1|) we obtain the first bound 

\EiSn,iie))-Seo^g{9)\ < [J\^^m)\\fx,zhHflW)^Kc„-iflW)\\2 

+ {2 l^\^{t)v-yo{t)dt) I II f(,o{-)fx,z{x,-) II2 dx II {fpW)*Kc„ - fpW 
Applying Parseval's formula we get 

\E{Sn,im-Sg0^g{e)\ < (2^)-l( rr?2(t)di) ||/x,z||2||(/|H^r(i^£„-l)||2 



+(7r)-i( r r,^{t)r,.^,{t)dt) I \\ /^o(-)/x,z(x, •) II2 dx \\ {fpWr{K*c^ - 1 



that is 



(7.3) |E(5„,i(0))-5,o,,(0)| <C(7,7°,//3o) 



{flwnK*c^ - 1) II2 + II {fpWnK*c^ - 1) II2I . 
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According to (|7.2j) . \K{Sn,i{0)) — 5go is also bounded by 



(/>)*i^c„-(/>) 



fx,zix,-) 111 dx(^ J rj^{t)dt^ 
+ II {fpW)^Kc^ - {fpW) lU (2 j II /^o(-)/x,z(x,-) 111 dx j\^{t)r^^,{t)dt 
<\\ {flwnK*c^ - 1) 111 ((27r)-i II 111 f v'.m) 



+ II (//3T^)*(K^„ - 1) 111 (vr-iy II /^o(.)/x,z(x,-) 111 dx ^^{t)fp,r^^,{t)dt 
This implies that 



\nsn,im-Se^, 



< 



+ 



ri'^{t)dt\ II {flwr{K*c^-l) 111 
E|/^o(Z)| [\^it)rj^oit)dt\ II -1) 111, 



that is 



(7.4) |E(5„,i(e))-5,o,g(0)| <C(7,7°,//3o) 

[II (/;3VF)*(i^5„ - 1) 111 + II ifpWnK*c^ - 1) 111] . 
By combining the bounds ^l.'A^ and (|7.4|) we get that 

(7.5) |E(5„,i(^)) - 5,0,^(0)1 <C(7,7°,//3o) 

X minjll if/3WriK*c^ - 1) h + II " 1) lb, 

II {fpWnK*c^ - 1) 111 + II {fpWnK*c^ - 1) 111} . 

By applying Lemma 18.21 

\nSnAO)) - Seo,g{e)f = 0(c-2-+i--+(i-)- exp(-2dC;)) = 0(1). 

Study of the variance Since the random variables are i.i.d., we get that 

(2 + 0(1)), 



Var[S„,i(e)] 



n 



with 



A, 



E 



if'pW)*K^,cAU) / v'it)Y{t)dt 
Jo 

U(sW)*Kr.,cAU) r r,^{t)dN{t) 



and A2 = 4E 



According to (|7.2j) and by applying Lemma l8.H Ai is less than 



ri'^{t)dt) / \{fxAx,-)*fe,{{flW)^Kn,cS)\dx 



< 



r,l{t)dt 



fx,z{x,-)*fe lloo dx II {f^W)i<Kn,C„ 111 



and hence 



Ai < (27r)'i( / vl{t)dt] II /, llooll fx,z \\i 



/I 
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In the same way, we get a first bound for A2- Let us denote by 

(7.6) ^{X,Z) = f\.,{t)dN{t). 

Jo 

According to Lemma IHTTl and to (|7.2() . A2 is bounded by 

4 J {{^\x,-)fx,z{x,-))*fe,{{ff^W)*Kn,cJ^)dx 

<^j\\ {V\x,-)fx,z{xr))^fe lloo dx II {fpW)^Kn,Cr. 

Since 



we get that 



^\x,-)fx,z{x,-)\\idx = ¥. / ri^{t)dN{i) 



V\x,-)fx,z{^r))*fe\\oodx<\\fe\\oo^ / Vj{t)dN{t) 



Consequently, 

VI2 < 4 
It follows that, 
(7.7) Var[5„,i(e)] < 



(2^)-1e( v^{t)dN{t)) II /, 
C(^°, II fe Hoc 



n 



K* 2 

fe 



+ 



K* 2 

Je 



According to H7.2() . j4i is also less than 

[jy^{t)dtf j |(/x,z(x, •) * ((/^i^) ^e:„,cJ'>| 

< (^jy^{t)dtf II y /x,z(x,-) 111 dx II (/|iy)*ir„,c7„ II 

In the same way A2 is less than 
4 y {{v\x,-)fx,z{^^-))*feA{fi5W)^Kn,cS)dx 

<Aj\\ {^\x,-)fx,z{x,-))*fe 111 II {fpW)^Kn,C„ 

where <^{X, Z) is defined in ()7.6|) . Once again, since 

(99^(3;, .)/^_2(:r, •))*/. ||i(ix = E( [\y{t)dN{t)Y, 



(7.8) 



Var[5„,i(0)] < 



C(6'°) r k;. 2 

fe 1 



71 



By combining (|7.7|) and (|7.8|) . we obtain that 
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According to Lemma 18.21 

Var[5„,i(0)] = 0(c2("-«)+i-^+(i-^)- exp(-2dC; + 25C^)/n)^ 

and hence under dSH), E [Sn,i{0) - Sgo^g{9)] = o(l), as n ^ oo. 
Proof of 3- 

By definition Sn,iid) — Sn,iid') equals 



2 

n Jo 



[{fpW)*Kr,,c„mv^{t) - {fp'W)^Kn,C^{Ui)r]yit)] dNi{t) 

[{f|W)*Kn,cAU^W^it) - ifl,W)*Kr,,c„mv'yit)] Y,{t)dt. 



1 

+ - 

n 



Under ^K^ , d^TH ) and for C„ satisfying (gSl), since ||6' - 9'\\p < pk, 

we get that E(|5„,i(e) - = Hence 3- follows. □ 



7.1.2. Rate of convergence. Denote by S^\{9) and sl^\{9) the first and second derivatives 
of Sn,i{9) with respect to 9. By using classical Taylor expansion and the consistency of 9i, 

+ Rni9i - 6*°), with Rn defined by 

(7.10) 



we get that = 5g(^i) = + sl^}{9'm 



Rn 



[S^n\{0' + s{9^-9'))-S^S,{9')]ds. 



This implies that 

(7.11) e,-9' = -[S^;^l{9') + R„]-'S):;, 

Consequently we have to check the three following points 



I) E 



ii) E 



o(l), 



iii) Rn defined in ()7.1U|) satisfies E(|| i?„ = o(l) as n ^ cxd. 



iv) E||0i- 



\% < 4E 



isi!]{9')V 



00 5'„,i 



+ 



The rate of convergence of 9i is thus given by the order of si^J(eO) - sJJU^°) = si'iC^O). 



Proof of i) 



According to (jS^, 5^^|(6'°) equals 
(7.12) 



1=1 



(4!,V) * K„,cJf/i)v(*)^^i(*) + / iU°fpoW) * i^„,c„(C/i)<o(t)l^i(t)dt 



(i)i 



V" 



Study of the bias By definition, K{dSn,i{9) /d(3)g^go equals 



-2E 



{f^^M)*Kn,cAUi)ri^o{t)dNi{t) 



+ 2E 



(//jo/^oV) * Kn,c„{Ui)7j-'^omi{t)dt 
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Hence, according to Lemma l8.H 
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E 



dSn,i{e) 

dp 



-2E 
-2E 
-2E 
-2E 



V(Zi)(/(jV)*i^„,c„(C/i) / v'o{t)Yi{t)dt 



{ffSof^^o^W)*Kc„{Z,) I r,t^,{t)Yi{t)dt 



Since 



dp 



-2E 



r 7f Jt)Y^{t)dt +2E rif Jt)Y^{t)dt 



0, 



we get that E (i95n,i(6')/9/3|g^go) also equals 



- 2E 



fpo{Z,)[{&M)^KcSZ,) - (4'oV)(Zi)] r r,l^,{t)Y,{t)dt 



+ 2E 
-2 



(V(-)/x,z(x,-),[(4oV)*Kc;„-(4i^VF)]) rit^om.>tdtdx 
In the same way, according to Lemma l8. 11 



E 



2E 

+2E 

-2E 

+2E 

-2E 

+2E 



{fpoW)^Kn,cAUi) rj^^J{t)dNi{t) 

{floW)*Kn,cAUi) rv^o{t)r,^o{t)Yi{t)dt 
Jo 

fpo iZi){fpoW) * K^,c„ (Ui) y ' v^^^ it)VyO it)dt 
fpo{Zi)ifpoW)*Kc„iZi) r v^'Jit)vY^it)Y,it)dt 



{floW)^KcSZi) / r/(i,)(t)r?^o(t)yi(t)<it 



Since 



-2E 

+2E 
0, 



/^o(Zi)(VTy)(Zi) 7^^^}r^^o{t)Yi{t)dt 
ifpoW)iZi) [\^^,\t)rj^oit)Yiit)dt 
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we obtain that E {dSn,i{0)/dj Ig^go) equals 

+2E [ifloW)^KcAZi) - {fpoW){Z,)] J\^Jit)ri^o{t)Yiit)dt 

+2 J (/x,z(xr),[(/|ot^)*ifc„-(/|oW)])^\:Jo^(t)v(*)I->*^*'^^- 
A first bound for tliis bias term can be obtained by writing tliat for j = 1, - ■ ■ ,m 

ii/2)E {dSn,mm\o=eo) 

is bounded by 

f^^^w)*Kc„ - (^o^^Vf) II2 (1 II //3o(-)/x,z(x,-) II2 dxJ\^^om 



+ 



< 



{f^pJj(3oW)^Kc„ - [f^JjpoW)\\^ II fxA^r) II2 dxJ\^^oit)dt 
{fpojWy {K*c^ - 1)||^ ((2^)"'/ II //3o(-)/x,z(x,-) II2 dx J^\'^o{t)d?j 

(4i//3oVF)* {K*c^ - ((27r)-i / II •) II2 dx f rf^o{t)dt 



(l/2)E(a5„,i(^)/57,|,=,o) 



In the same way, 
is bounded by 

II {fpoW)i.Kc^ - fp.W II2 II fp^{.-)fx,z{x, ■) II2 dxj^^ \r,^:^l^{t)\n^o{t)dt 

+ II {fl,W)i<Kc^ - fjoW II2 II •) II2 dxj^ \ri%it)r,^oit)\ d?j 

<\\ {fpoWr{K*c„ - 1) II2 ((27r)-^| II fpo{-)fx,z{^r) II2 dxj^^ \v%{t)\v^o{t)dt 

+ II ifpoWr{K*c^ - 1) II2 ((2^)"'/ II II2 dx£ \v%{t)vAt) 

Consequently 

fdSn,ii9) 



dt 



(7.13) 

and 
(7.14) 



E 



< c{e^) [|| (i^£„ - 1)||^ + II {U^l^fp^wy iK*c^ - 1) 



E 



dSn,iiO) 



< c{e') [II [f^wy (K5„ - 1)11, + II (fl^wy {K*c^ - 1) 
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A second bound for the bias term can be obtained by writing that (1/2)E {dSn,i{9)/df3j 
is bounded by 



< 



\\fpo{-)fx,z{x,-) Widx rvUt)dt) 



fx,z{x, •) Widx / %Q{t)dt 



[f^^}wy {K*c^-l)\\^ {{'^^r' j II 111 dx l^\'^o{t)dt 



+ 



fj^^JpoW) {K*a„-l) ((2vr) 



fx,z{x, •) 111 dx / 1] o{t)dt 



In the same way (1/2) \K {dSn,iiO)/d^j |g=go)| is bounded by 



II {ff,oW)*Kc„-ff,oW 



II fi3o{-)fx,z{x,-) 111 dx I li^^ .{t)7]^o{t)dt 



+ II {floW)^Kc„- floW Hoc (y II fxA^r) 111 d^;^ 



(x,-) 111 dx I li^^ {t)ri^o{t)dt 



<\\ {fpoWnK*c^ - 1) 111 ( (2^)-^ / II /^o(-)/x,z(x,-) 111 dx j r^^']mri,o{t)dt 



+ II {f^oWnK*c^ - 1) 111 ( {27T)-' I II fx,z{xr) 111 dx I 7i^^J/t)v^o{t)dt 



Consequently 



(7.15) 



and 



(7.16) 



E 



E 



dSn,i{9) 
dp, 



dSn,m 

9lj 



<C{e^) \ {f%W) {K*a^-l) + [r;/,f,oW) (A-£„-l) 



=6»o 



<C(0O) \\{f^oWr{K*c„-l)\\^+ {fpoWr{K*c„-l) 



By combining (fTn)) . (fTTH) . (fTT^I) and (fTT^ we get that 



(7.17) 



E 



9^n,l(g) I 



< C(0° 



X mm 



4o>) {K*c^-i) + (i^5„-i) 



<■(!) 



and 
(7.18) 



E 



dSn,m 



=00 



mm\\\{fpoWnK*c^ - + {floWr{K*c^ - 1) 

\\{ff,owr{K*c„ - + {f^ownK*c„ - 1) 
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According to Lemma 18.21 



and 



E 



E 



dp, 



=00 



O (c-'^c.+i-r+ii-r)- exp(-2(iC7;; 



Study of the variance 

For the variance term, it is easy to see that 



Var I — — Ig^go = [Vij + V2,j\ , 



with 



and 



yi,,=E 



n 



In the same way 



Var 



[1/30 



Jo 

dSn,i{^)\ _ 8 + o(l) 



n 



with 



and 



V4,j = E 



_ 2 



According to Lemma l8. 11 



and 



y3,:i < 



1 2 



vloit)dt 



ri%{t)v-yo{t)dt 



fx,z{x, •) *fe,[[ 4'o|,//3oVF) * Kn,C,. 



By applying the inequahties (|7.1jl and (|7.2|) we get that 

2 



and 

V3J < 



uo 



%o{t)dt 
X min jll lloo 

Tl^^l,{t)r]A^)dt 



On 



X min < II fe 



* K* 



fsoW 
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Now, according to Lemma l8. II we have 



25 



and 



^2,,- < 



dx 



dx, 



{X,Z)= I v^o{t)dN{t) andy,4j{X,Z)= I r]':^,\.{t)dN{t). 



where 

¥'2 1 

By applying the inequahties <\7.1\i and (|7.2() we get that 

2 



V2,i < E 



rj o{t)dN{t) 



X min < II fs 



and 



V4,j < E 



V%{t)dN{t) 



1 2 



X min < II /e 



fi 



ifpoW) 



/I 



The result follows by combining the bounds on the Vkj's for k = 1, ... ,4 and by applying 
Lemma l8?2l to get that 



Var 



dSn,iie) 



The proof of 3) follows by choosing that realizes the trade-off between the squared 
bias and the variance. 



Proof of ii) 



According to (jSUl), S^^\{e^) equals 



with 



(-5*1^1)1,1 



i=l 

1 " 



+-E 

n ^ 



i=l 



9/3- 



■2 le^ 



=eo *Kn,aAU^) / T]\{t)Yi{t)dt 
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n 

i=l 



dy 



2 l6i=e" 



Y,{t)dt. 



Under for Cn satisfying (jSH), E[5i^J(0O) - s'j^} ^{9^)]'^ = o(l). Hence ii) is proved. 

Proof of ill) 

The proof of iii) follows by using the smoothness oi (3 ^ Wffs and /3 i-^ Wf^ up to order 3, 
the smoothness of 7 i— > r/^ and 7 77^ and by using the consistency of 9i. 

Proof of iv) 

Let us introduce the random event En = rij^kEn,j,k, where 



(jj such that 



dOjdek 



- 2 dOjdOk 



We first write that 



30||2 



< nPi-eYp'^E^ + 2snv\\e\\lnE'n) 



According to H7.1U() and (|7.1H) we have 



-2 



< C{m,p) sup 



< C{m,p) sup 



1 61=610 



ddjddk 



E 



It remains thus to show that P(-E^) = o((^^) with 



supE 

3,k 



dOjdek 



=00 



We write 



m+p m+p 

j=i k=i 



By Markov's inequality, for q > 2, 



1 d^SgoJd) 



-1 



2 '^=^" 



E 



g'(<ggO,g(^)-<gn,l(g)) 



6»=6»o + {Rn)j,k 



In other words, using that \a + h\'^ < 2'^ '^{\ay + |&|''), we get 
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is less than 



2<?-i 

+229- 



E 



E 



\e=eo 



E 



E 



dOjdOk 



E 



1 61=610 



dOjdOk ''^ 
d9jd9k 



+ {Rn)j,k 



1 61=610 



+ E|(i?„)j,fe|n. 



Now we apply the Rosenthal's inequality (see 
centered variables 



dOjdek 

recalled in Appendix), to the sum of 



dOide 



It follows that 



E 



E 



E 



le=eo 



n 



i=l 



Take g = 4 to get that 



E 



1 61=610 



E 



ddjdek 



dOide 



< C{r) \n'-'E\Wn,i,j,k\'' + n-''^^^''^^\Wn,i,j,k 



\e=e" 



Therefore under the conditions ensuring that 

v^'^So<^ JO) 

E 



< C(4) [n-^E\Wn,i,j,k\^ + n~^E^\Wn,i,j, 



e=eo TTT^ l6»=eo 



de 



7j0k 



we have 



E 



ddjdk ' 



=610 



OjOk 



1 61=610 



0(1), 



0{ipi)=o{^l). 



Now, by using the definition of Rn and the smoothness properties of the derivatives of iW fp) 
and up to order 3, we get that E((i2n)j^) = o{\\0i — O^W^^)^ and we conclude that 



Ell^i - e%'i2 < 4E 



dOjOk 



l6»=6»o 



39 ^9 k ' 



=eo 



5, 



□ 



+o{^l) + o{E[\\e,-eX,]). 

7.2. Proof of Theorem 13.21 : asymptotic normality. According to Theorem 13.11 and 
its proof, under ( |Ci| )-( [n3] ), Vn.j{9^) = 0(1) and the asymptotic normality of 9i follows by 
checking that 

v) S'^^lie^) AA(0,Si), with Si defined in Theorem El 

' n^oo 

Let Hn^ii Hn^i, Gn,i, and Gn,i be the processes defined for all t G [0,r] by 



(7.19) 



Hn,i{s) 
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(7.20) 



(7.21) 



(7.22) 



Hn,i{s) 



^(/^^0^H^)(^.)V(«) 
-|(/^oTy)(Z,),?«(s)^ 



and 



' ^(//3o4;V)*E:„,c„(c/i)r?2o(, 



Gn,i{s) 



v^(/^oT^)(^.)<v(s)v(^)y 

According to (fTTT^ . since Ni{s) = Mi{s) + Ai{s, 6^, Zi) (see (jH^I), we get that 



A1+A2 + A3 + 



with 



i=l ^=l 

A3 = E / [^n,i(s) - i?„,i(s)]dAi(s,0°,Zi) and A^ = Y, [GnM " G„,i(s)]yi(s)cis. 



i=l 



Study of ^1 

The term Ai is a hnear combinations of stochastic integrals of locahy bounded and pre- 
dictable processes, H^^i, with respect to finite variation and local square integrable mar- 
tingales, Mj(-). Consequently, E(Ai) = 0. Denoting by < M > the predictable vari- 
ation process of M we have to verify the two following conditions for all t in [0, r] (see 
( Andersen, Borgan, Gill, and Keiding 1993) ) page 68) : 

LI) Y17=i lo ^n,i{s){Hn,i{s))~^ d < Mi > (s) Si(*)) with Sf(t) a positive covariance 

matrix defined by 

T 



(7.23)S?(t) = 4E 



(/ W W){Z.^r^^o {s)\ ( W) {Z,)v^o (s) 



(i)i 



\{fpoW){Z,)i^^'hs) I \{f(,oW){Z,Wo'{s) 



(1), 



rj o{s)Yi{s)ds 



L2) For all e > 0, Er=i /o ^n,Ks)(^n,^(5))^I||H„,(s)||,.>. d < M, > (5) = 0^(1) . 
Proof of LI) 

Since < Mj >= Aj, we have to prove that for all t £ [0, r], 
(7.24) V / Hn,i{s){HnAs)VYiis)fpo{Zi)7]^o{s)ds ^ S2(t). 

i=l 

We apply the following Lemma, which is a straightforward consequence of the fact that the 
set of functions = {a; 1— > Ia;>t} is a P-Glivenko Cantelli class (see van der Vaart and Wellner 
(HMD- 



Lemma 7.1. For j = 1, 



, m 



sup 

0<i<r 



1 ^ 

-Y^m)fpo^AZ,){fj^lw){Z,)-E[Y{t)fpo{Z){f^^^^^ 

i=l 



P.S. 
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sup 

0<t<r 



sup 

0<t<T 



1 

-Y.Y,it)f^oiZ,)WiZ,) - E[Yit)f^oiZ)W{Z)] 

i=l 

^Y.^^{t)fpo{Zm!ilw){Z,)\'-E[Y{t)fpo{Z)\{f^^^^^ 



i=l 



P.S. 



and sup 

0<t<T 



1 " 

-Y,YMfp-iZ^)WiZ,)f~E[Y{t)\ff,o{Z)W{Z)f 



i=l 



P.S. 



0. 



Thus LI) is checked . 
Proof of L2). We have to check that for all j = 1, . . . , m 



1 



-E 



n 



" /■* r r 1 1 2 



,j=i 



0(1) 



and that for all j = 1, . . . ,p 



1 



E 



n 



n rt 



[f^o{Zi)W{Z,)r^^^l^{.s)] l a) ^^^^^^^fpo{Z,)rj^o{s)Y,{s)ds 



0(1). 



This is a straightforward consequence of Lemma 17.21 bv writing that for j = 1, • • • ,m 



-E 



n 



E / [fpo/Z,)W{Z,)rj.^o{s)\ I| .1) (^^^^(^^^^^^^^|^^^/^o(Z,)r?y,(5)y,(s)ds 

nt 

/ i4o^(z^)w^(z.)v(s)i'//3o(^.)v(^)^*(«)^^ 



n^/ne 



.1=1 



o(l) 



and for j = 1, • • • ,p 



1. 



-E 



re 



rt ^ 




[fln{Z,)W{Z,){r^^,.{s)\ (^^|^^^/^o(Z,)v(^)^i(^)rfs 



1 



< ^E 

en^Jn 

Thus L2) is checked. 



.1=1 



Y / |/Jo(Z0W^(^0l'l<o;,.(5)|V/3o(Z,)v(^)^i(^)^* 



0(1). 



Study of A2 

Since E(742) = 0, we use the following lemma, analogous to Lemma l7. II 
Lemma 7.2. Under ([Ai4D - ([Ai5D , for Cn satisfying it 5'. 6|) thenfor j = 1, 



sup 

0<t<T 



1 " 

- E^^(*)V(^0(4Jiw^) *i^n,c„(t/^) - E[y(t)v(z)(/«.H^)(z)] 



anc? sup 

0<t<T 



\ Y Y^{t)f^oiZi){ff,oW) * Kn,c„m - E[F(t)/J„(Z)W(Z)] 



, m 



P.S. 



P.S. 



0, 



0. 



It follows that A2 = Op(l). 



Study of A3 
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It is noteworthy that the term ^3 can be seen as triangular arrays of row-wise independent 
centered random variables that is 



i=l 



with Y17=i ^ri.i = ^3 — 12(^3). Consequently, the asymptotic normality follows by checking 
that 

v-a) E(^3) = Op(l) 
v-b)Er=iE[(K.)'] - 

n — 

v-c) For all e > 0, Etl^[i^n,^)%v„,,\\,2>e\ ^ (Lindeberg Condition). 
By definition, A3 equals 



y2 
^3 



E 



if^^}W)^Kn,cAU^)-if^fiW){Z,)vy>is)\ 

^ ^ (1), . Yi{s)ff3o[Zi)7]^o{s)d 



Let us start with the study of the variance (v-b). Under dCiD - JHs] ) 



Var 

and 
Var 



^ i=l 



i=l 



0(1), 



0(1). 



It follows that v-b) is checked. 

We now come to the bias term and write that 



E{A3) = E 
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According to Lemma l8. II 

^ E{[(4^oV)^i^cJ^)-(4oV)(z)] v(z)/o^y(.)4(.^^^ ^ 

-2^/n 

^E[[{f^oW)*KcAZ) - {fpoW){Z)] fpo{Z)f^Y{s)v^^^{s)v^o{s)ds}^ 
( E{[(/(iV)^i^cJ^)-(4oV)(Z)] V(Z)/o^y(.),7?o(^^^^ ^ 
■poW)^KcAZ) - {fpoW){Z)] V(Z)/o^y(s)r?(i)(s)v(5)ds}^ 
l!{u)^oW)*KcSz) - {f^fiw){z)J^o{z)fx,z{x,z)) (/o^I,.>,r?2„(5)ds) dx^ 

= -2V^ 

^ J{{f/3oW)^KcM) - {fpoW)iz),fpo{z)) (^£l,,>sv[)^{s)7j^ois)ds) dx ^ 

For j = 1, ... ,111 

{f^o]jW)*KcAz) - {f^fiw)iz),ff,o{z)fx,z{x,z)) (^l\,>sV^^o{s)ds^ dx 



E 



HAZARD FUNCTION ESTIMATION WITH MISMEASURED COVARIATE 



31 



is less than 

^?o(s)ds) I |((4j;^.t^)*Kcj^)-(4o;,vF)(z),v(z)/x,z(x,z)) 

which is, according to <\7.1\i and (|7.2() . less than 



dx 



that is less than 
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In the same way we obtain that for j = 1, . . . ,p 
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Consequently, under JXI^ , EiA^) = o(^c7-'*+(i-0/2+(i-r)-/2exp(-dc;)^_ ^^^^^ jnTT )- 
( IC3I ), Var(A3) = 0(1) and hence C„ can be chosen such that £(^3) = o(l). It follows 
that v-a) is checked. 

In order to check the Lindeberg condition we write that for j = 1, - ■ ■ ,m 
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It follows that v-c) is checked. 

Study of ^4 

The study of ^4, quite similar to the study of ^3 is omitted. 

7.3. Proof of Theorem 14.11 : The proof of Thorem l4.11 quite classical is omitted. 
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8. Appendix 

Lemma 8.1. Let ip be such that K{\ip{X, Z)\) is finite and let ^ such thatE,{\^{U)\) is finite. 
Under the assumptions jAsj ) and jA4| j, then 

E[^(X, Z)$ * Kn,cAU)] = n^iX, Z)$ * KcAZ)], 

and 

E[^{X,Z)<!>^Kn,cAU)? = I {{v\x,-)fx,z{x,-))*fe,{'^*Kn,cJ^)dx. 

Proof of Lemma 18.11 : Set fx,u,z the joint distribution of (X,U,Z). Under ( IA3D and 
( IA4I ), fx,u,zix, u, z) = fx,zix, z)f£{u — z). Hence by the Parseval's formula 



E[v3(X,Z)$*K„,c„(f^)] = 1 1 j (p{x,z)<^-kKn,c^{u)fx,zix,z)fe{u- z)dudxdz 

Lp{x,z)fx,z{x,z) j <^-kKn,c„iu)feiu- z)dudxdz 
(27r)-i // ip{x,z)fx,z{x,z) [ <S>*{y)K*cJy)f:iy)e-^y'dydxdz 



C^^r' II ip{x,z)fx,z{x,z) I ^*{y)^p^f*{y)e-'y^dydxdz 
C^^y^ II ^(.x,z)fx,z(.x,z) I ^*{y)K*cJy)e-'y'dydxdz 
ip{x,z)fx,zix,z) I ^{u)Kc„iz - u)du dx dz 
(p{x, z)^ -k Kc„ {z)fx,z{x, z) dx dz. 

J J 

In the same way, 

E[ip{X,Z)<!>*Kn,cAU)f = lll^\x,z){^kKr,,cMffx,z{x^^)fe{u-z)dxdudz 

Lf'^ix, z){^ -k Kn,c„iu)f fx,z{x, z)fe{u - z)dxdudz 

{{ip\x, ■)fx,z{x, •)) * fe, * Kn,cS) dx. □ 

Lemma 8.2. For a, r two nonnegative numbers, Then 

(8.1) [ \ureM-Muf)du < \ C-'^+'-'eM-^Ci}. 

J\u\>c„ C[u,A,d) 

Furthermore, if satisfies ([Ai4t ), then 

I |n|-exp(-A|n|^) ^^ ^ \,,r(f^ "^^"t^' '^^^-'^ exp{-AC;: + 

Lemma 8.3. Rosenthal's inequality (Rosenthal (1970), Petrov (1995)). For Ui, . . . ,Un, 
be n independent centered random variables, there exists a constant C{r) such that for r > 1, 



i=l i=l i=l 
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